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ABSTRACT 

Let  W^,  n = 0,1,...  be  the  time  until  the  nth  arrival  of  a 

Poisson  process  with  rate  0 . Using  invariant  loss  L ( 9,  9)  = 

"■2  ^ 2 

9 (9-9)  and  sampling  costs  involving  cost  per  arrival  and  cost 

* '' 

per  unit  time,  the  Bayes'  sequential  procedure  (N  , ) is  derived. 

N* 

The  large  sample  properties  of  the  procedure  are  then  studied  in  the 

ilr 

classical  framework,  and  N , the  stopping  time,  is  shown  to  be 

* 

asymptotically  equivalent  to  n , the  best  fixed  sample  size  procedure 
when  0 is  known.  Asymptotic  normality  of  the  sequential  estimator 
0 is  also  shown. 

N* 
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UiRGE  SAMPLE  PROPERFIES  OF  THE  BAYES ' SEQUENl’IAL 


PROCEDUl^  IX)R  ESTIMATING  THE  AlOUVAL  RATE  OF  A POISSON  PROCESS 
WITH  INV/ARIANT  LOSS 
C.  P.  Shapiro^  and  Robert  Kardrop^ 

1*  Introduct ion ■ Let  W^,  n = 0,1,...,  be  the  tine  until  tie  nth  arrival  of  a Poisson  process 

with  rate  0.  Take  = 0.  Conditional  on  0,  has  a gamma  distribution  with 

shape  parameter  n and  mean  n/0  (gamma (n , 0 )) . Let  F^,  n = 0,1,...,  denote 

the  sigma  algebra  generated  by  W^,  0 _<  i _<  n.  Sequential  estimation  procedures 

of  the  form  (N,6j^)  are  considered,  where  N,  the  number  of  arrivals  observed,  is 

a stopping  time  with  respect  to  F , and  0.,  is  an  f.,  measurable  random  vari- 

n N N 

able,  with  Fj^  the  sigma  algebra  of  events  prior  to  N. 

^ -2^2 

The  loss  due  to  estimation  is  L(0,9)  = 9 (9-0)  . Using  this  loss,  the  de- 
cision problem  of  estimation  of  0 is  invariant  under  the  group  of  scale  trans- 
formations (Ferguson,  1967).  Such  a loss  function  measures  the  estimation  error 

-2 

in  variance  units,  0 , and  forces  more  precision  at  small  values  of  0. 

The  cost  of  sampling  involves  two  components:  c^  = the  cost  of  observing 

one  arrival,  and  = the  cost  of  observing  the  process  for  one  unit  of  time. 

It 

In  Section  2,  the  Bayes*  sequential  procedure  (denoted  (N  ,9  ) throughout 
the  paper)  is  derived  in  Theorem  2.1  using  a gamma  prior  on  0 and  the  loss  and 
cost  structures  described  above.  In  Sections  3 and  4 the  large  sample  properties 
of  the  procedure  (N  ,C  ) are  examined  without  reference  to  the  Bayesian  origin 

N* 

it 

of  the  procedure.  The  limiting  form  of  N is  given  in  Theorem  3.1  and  the 

asymptotic  normality  of  6,,.  is  given  in  Theorem  4.1. 

N* 


^This  work  was  done  while  at  the  University  of  Wisconsin,  Madison,  and 
on  leave  from  Micliigan  State  University. 

^ University  of  Wisconsin-Madison , Department  of  Statistics. 


2.  The  Bayc-s'  Sequential  rrocodurc.  The  problem  of  finding  the  Bayes'  procediu  . 

(N  , 0 ) is  solved  as  follows.  For  a given  stopping  time  N , 0 is  the  HayC’S' 

N*  N 

estimator  of  0 given  f^.  The  optimal  choice  of  N is  then  obtained  by  findin>; 
that  stopping  time  which , minimizes  the  expected  total  cost  (Bayes'  risk  due  to 
estimation  plus  the  expected  cost  of  sampling) . See  DeGroot  (1970)  or  Cliow, 
Robbins,  and  Siegnund  (1971)  for  more  details. 

Suppose  0 has  prior  distribution  gamma  (0^,6^)  where  “q  ^ ^ ^ 

Then  the  posterior  distribution  of  0 given  is  ganuiia  with  = vi^^+n 

and  ~ ^0  ^ '^n'  ^sing  the  loss  function  given  in  Section  1,  the  Bayes'  esti- 

mator  of  6 given  f^  is  0^  = 6^  (a^-2) , and  the  expected  posterior  loss  usiiu) 

• ''I  -1 

®n  E [L (0 , 0^) I F^l  = (a^-1)  . Thus,  by  the  strong  Markov  property,  the  tota] 

cost  of  the  procedure  (N,0^^)  is 

The  Bayes’  procedure  minimizes  E(f7j^). 


* -1 
Define  stopping  rule  N = first  n > 0 such  that  c„6  + c.a  > a + c 

— TnAn— n 


A' 


Note  that  P (N  < “)  = 1,  and  that  the  rule  is  easy  to  use  since  both  q and  i' 

n n 

It 

have  a simple  form.  Also,  note  that  if  c,j,  = 0,  then  N is  a fixed  sample  size 

it 

stopping  rule  since  only  is  random  in  the  defining  expression  for  N . Due 

to  this  degeneracy,  assume  c^  > 0 henceforth. 

Lemma  2.1. 


i) 

If 

c^  > 0 , then 

* -1/2 
! - "a 

1 

ii) 

if 

c^,  > 0,  then 

N*  < (BqC^) 

-1 

Proof 

— 

Define  stopping  rules 

and 

^T 

by 

^A  = 

first  n>0 

such 

that 

‘^A^n 

Nt  = 

first  n>0 

such 

that 

^T^ 

Then 

it 

N 

^ min (N^,N^) . 

From  tlic 

definition 

of  N 

-1  ^ 

'«  + c. , 

- n A 

. -1 


either  N.  = 0 or  N,  - 1 
A A 

satisfies  the  reverse  inequality:  This  last  expression 

-1/2  A A 

implies  that  £ c^  ' . A similar  argument  applied  to  N.j,  gives  N,j,<(Bq  c.^,) 

Tlie  following  lemma  is  a technical  result  needed  in  the  proof  of  Theorem  2.1. 


\ ■; 
1 ' 
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LeiTiJiia  2 . 2 


If  N'  is  a stopping  rule  such  that  EC„,  < w then 

N 

lim  / c dP  = 0 . 

' n 

n + {n  ' > nl 


Proof ; 

C = (a  -1)  ^ + c.n  + c„W  . Thus,  E C„,<  “>  if  and  only  if  EN'<  and  EW^, , < 
nn  ATn  N N 

Consider  each  of  the  three  terms  separately. 

(i)  /(a  -1)  ^ dP  < (a  -1)  ^P[N'>n]  ->■  0 as  n “ . 

, ,n  — n 

{N’>n) 

00  oo 

(ii)  / c^ndP  = ^ / ndP  < c^  I kP(N’=k). 

{N’>n}  k=n+l{N’=k}  k=nH 

This  last  term  tends  to  0 as  n-f™  since  EN'  < . 

OO  oo 

(iii)  / c^W^dP  = I ! V'f^dP  1 c^  I j Wj^dP 

{N‘>n}  k=n+l  {N'=k)  k=n+l{N'=k} 

since  W < W,  for  k > n. 

This  last  term  tends  to  0 as  n-x®  since  EW^^,  < o». 

The  theorem  below  states  that  the  decision  procedure  (N  minimizes  t 

expected  total  cost  among  all  decision  procedures  (N',0j^,)  with  N'^O,  and  th.:r. 

* ^ 

(N  fOjj*)  is  the  Bayes’  procedure. 

Theorem  2.1.  If  (N'jOj^,)  is  a sequential  decision  procedure  then  C (r  ) £ C (c\. , ) . 


Proof.  The  cost  sequence  is  first  shown  to  be  in  the  monotone  case.  Note  tluii. 
given  0,  ~ '^n  ^ ^ ^ i®  exponential  (0)  and  independent  of  Thus 

Taking  and  u.sing  E(0~^lr^;  = i ; ' yields 

- (a  -i)~^  + c.  + c.^(a  -1)”^P  . 
n+1  'n  n n n A T n n 

* 

Thus,  the  cost  sequence  is  in  the  monotone  case  and  the  rule  N can  bo  cxprcs.sc; 

* 

as  N = first  n>0  such  that  E(c  ^,|c  ) > C . Now  since  c„,''0,  Lem-ma  2.1 

— n+1  n — n T 

implies  Ec^g,  < Let  N'  be  any  stopping  time  such  tliat  N'  ^0.  If  “ ''  t 

* 

then  N is  obviously  better.  If  < ~,  then  Lemma  2.2  allows  application  of 

the  monotone  case  theorem  (Chow,  Robbins,  Siegmund,  1971)  to  conclude  <E<.\., 
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* * 

3.  Large  sample  properties  of  N . In  this  section  the  stopping  rule  N is 

excunincd  in  the  classical  framework.  The  parameter  6 is  considered  fixed  but  un- 
known and  all  probabilities  and  expectations  are  conditional  on  6 and  denoted 

* -V 

Pg,  Eg,  respectively.  The  procedure  (N  /®jg*)  does  not  minimize 

0,  but  only  the  average  of  over  the  prior  distribution  of  Section  2. 

* '' 

The  large  sample  properties  of  the  procedure  (N  are  studied  by  letting 

the  sampling  costs  tend  to  zero.  Note  that  the  stopping  rule  N is  a function 

of  the  sampling  costs  c = • Define  n*  = n*(0)  = (c^  + 0"^)"^^^.  The 

* 

main  result  in  this  section  (Theorem  3.1)  is  that  N is  asymptotically  equi- 

* -1 
valent  to  n as  c tends  to  £ = (0,0)  with  c^c^  tending  to  Cg  £ As 

* -1 

motivation  for  this  limiting  form  of  N , compute  equal  to  (a^-1)  + c^n  + 

n c,p  0 where  the  expectation  is  conditional  on  0.  Let  H(x)  = (a^-1  ) ^ 

c,j,x  0 Then  H(x)  attains  a unique  minimum  at  x = (c^  + c^.o"’^)  + (0^-1). 

* 

Ignoring  the  (Og-l)  term,  this  minimum  is  n defined  above. 

The  following  lemmas  give  rates  and  uniform  integrabi 1 i ty  results  needed  in 
the  proof  of  Theorem  3.1.  Two  cases  are  considered  depending  on  the  limit,  Cg, 

of 

Lemma  3.1.  For  each  c > 0 , 


P(1 — 5-  -ll>  c)  < b(c)cxp 


....(,.-1/2 


D(c,r  ) 1 , 


where  b(c)  » bg  < ■»,  and  D(c,c)  >D(e)  < 0 and  finite,  as  c^,c^-*0  such  that 


Proof;  Pg( — - l>t)  = Pg(W|^<E|^),  where 

Wj^  = waiting  time  until  the  K^-  arrival,  = luj^c.^,) 


^a'-'t  " ''o' 


and  k = [(1+r.  )n  ) with  (•]  the  greatest  integer  function. 

For  all  t >0,  P^(Wj^<Bj^)  = Pg(exp(-tW^)  >exp(-t  Bj^)  ) < cxp(t  Bj^)  EgOxp  (-tV<Y_ ) 

= exi^d  Bj,  - k fn  (1  + t 0 ^)) 

by  Bernstein's  Inequality.  Since  [xj  2;^  x-1,  the  exponent  is 
£b^(c)  + (t  ,c , c ) , where 


i' 

y 


b (c)  = tn(l+tO  ) - 6pt  and 


D (t,c,c)  = 


t(y->  e.  1)'^^  e‘/^g^c)  tna^tQ-i) 


+ c) 


(c^c“^0  + 1)^/2 


(c^c‘^  e H-  1)^/2 


As  c^c~^  ►Cp  < “>,  D'*'(t,c,E)  - D'^(t,E)  > - ” for  all  t . But  o'^(t,c)  < 0 if 


and  only  if  0(l  + c)^> 


° "o 

l+c”^t“^iin  d+te"^) 


. The  right  hand  side  tends  to 


0 as  t ••  0.  Thus  there  exists  t^  such  that  D^(t,c)  < 0 for  all  t £ t^. 

* 

A similar  argument  yields  Pq  {^*-1  < -e ) < exp  (b“  (c)  + D~  (t , c , c ) ) , where 

D (t, £,£)-►  D (t,£)  > and  D (t,E)  < 0 for  all  t£t  , for  some  t . The 

proof  is  completed  by  setting  t^  = min(t^,t~)»  D(c,e)  = max (D^ (t^ , c , c ) , 

D (tp.c.c)),  and  b(c)  = 2max(exp(b  (c) ) , exp  b^ (c) ) . 


Lemma  3.2.  For  each  r. , 0 <c<l 

„ , I N*  , I . 


Pgd-^  - l|>  c)  £ b(c)exp  c^^^  c~^D(c,c) 


where  b(c)  "*■  b^  < and  D(c,c)  ->■  D(c.)  < 0 and  finite,  as  c,^,c^->0  such  that 

r*  C ^ cr» 

a^'t 

Proof:  Tlie  techniques  here  are  similar  to  those  of  Lemma  3.1.  P„ ( ^ - 1 < -c) 

U 

★ 

Pq  (Wj^  > Bj^)  , where  Bj^  is  defined  in  the  proof  of  Lenuna  3.1  and  k = [(l-c)n  ] 

Following  Lemraa  3.1,  for  all  t < 6 Pg  (Wj^  > Bj^)  £ exp(-t  Bj^-k  fn(l  - tO  ^)), 

< exp(b  (c)  + c,j,^  D (t,£,r))  where  b (c)  = -J-n(l  - tO  + B^t  and 

-t(f+c  ,,,  0^/2(l-e) (t-c  c7^  fn(l-t3-l 

D (t,c,r)  = -jy^ ^ tcf^a  ^ 

0^/^(l-£)  + (ag+l)  (c  o+c  ) " (G+c  c 


^s  c^tc^  -+ 0 and  c^c^  ^ , D (t,c,r)  ->  D (t,c)  < 0 and  finite  for  all  e <1. 

N* 

Similar  methods  applied  to  P„  ( -1  > c)  complete  the  proof. 

" n 

Lemma  3.3.  If  c,,c,„  -*  0 such  that  c.c„  •*•  Cr,  < «>  , then 
A T AT  0 — 

* ♦ 

(i)  N /n  is  uniformly  intcgrr.Mc  (Pq)  » and 

* * 

(ii)  n /N  is  unifcjrmly  intcgrable  (P„). 


Proof ; 

(i)  Take  a > 1 + {.  Suppose  Cq  < Then  Lemma  3.1  implies 

* N*/n*dP  < P„(NVn*  >1  +c) 

{N  /n  >a}  e - 0 T 0 

“1  -1  Z') 

< (3qC.j,)  b(c)  expCc.^  ^ D(c,t)  ) , 

which  tends  to  zero  as  c -►  0.  If  = «>,  then  N*/n*  < (1+c.pC”^?”^) 

. — 1 * * 

Thus,  since  ^.^c^  ->  0 , N /n  is  uniformly  bounded  in  c and  hence  uniformly 

integrable . 

, • • . * * * * 

(ii)  N ^ 1 implies  n /N  £ n . Thus, 

/ n /N  dPg  < n P (n  /N  > a)  = n P^(N  /n  < a ). 

{n  /N  > a) 

Take  a > (1-e)  Then  if  Cq  Lemma  3.1  implies  the  last  expres- 

sion is 

<(c^c‘^  + e"^)‘^'^^c“^/2b(c)  exp(c~^/^D(c,E)  ) , 
which  tends  to  0 . If  then  heirmia  3.2  implies  the  last  expression 

is 

1 (1  + c^^CpO  b(c)  oxp(cj^^^c”/D(c,  I ) ) , 

which  tends  to  0 . 

Theorem  3.1.  If  c^.c^  >■  0 such  that  ■>  c^  < “>,  then 

(i)  N /n  ->■  1 a.s.  (Pg)  , 

* 

(ii)  Eg-^-^1. 

Proof : (ii)  follows  from  (i)  and  the  uniform  integrabil  ity  shov.-n  in  Lerxia  3 . . 

To  prove  (i),  a f^orcl  Cantclli  tyi^e  argument  is  used  along  v.'ith  monotirit;>' 

* * 

properties  of  N and  n . Lot  c(k)  bo  a sequence  of  costs  decreasing  co- 

ordinatowise  to  0 such  that  Cp(k)”^  tends  to  Cg  < ,»  as  k . 

* * * * 

For  simplicity,  let  Nj^  and  nj^  denote  N ,n  respectively  when  cost  c(l.)  is 

* * 

used.  From  the  definitions  of  N and  n , 

'"k  ^ •'c  1 ""k+l  "k  1 "c  1 "k+1  ' 

for  all  c in  [c  (k  + 1 ) , c (k) ) where  containment  is  coordinatewise . Fix  c >0. 
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* * 

Then  Pq (N  /n  > 1 + e for  some  c < c (m) ) is 

“ * * 

£ I Pg (N  /n  > 1 + e for  some  c in  [c  (k+1) ,c (k) ] ) 
k=m 


k=m  " 


* * * * 

Since  tends  to  1 , choose  e'  = e/2(l+c),  and  m such  that  + l 


> 1-e*  for  all  k £ m.  Then  the  probability  above  is 


1 J Pe  < 

k=m 


0 ( > 1 + (t/2)) 


which  tends  to  zero  as  m ► “ from  the  exponential  rates  derived  in  Lenuias 
3.1  and  3.2. 

The  limiting  form  of  E„c  can  be  derived  as  a corollary  to  Tlicore."  3.2 

6 N* 


Corollary  3.1.  If  c^,c^  -»■  0 such  that  c^c^ 

* 

n Ef,C  ^ 2. 

0 N* 

Proof;  = Eg(a^_^-1)"^  + (c^  + c^0"^)EgN*,  and 


= ^6  + EgN*/n*  . 

The  last  term  tends  to  1 by  Theorem  3.1.  Also,  by  Theorem  3.1 

* -1  * _i  * * 

n (a  -1)  1 a.s.  (P„)  . But  n (a  ^-1)  < n /N  which  is  unifon.  ly 

N*  U N*  — 

integrable  by  Lemma  3.3.  Thus,  the  first  term  tends  to  1. 


• I 


: I 

k! 


4.  Asymptotic  normality  of  ^ concludinq  remarks.  Once  the  limiting 

* N* 

form  of  N is  found,  asjTnptotic  properties  of  6^^*  can  be  obtained  by 
standard  methods . 

Lemma  4.1.  Suppose  Xj^,X2,...  are  independent  and  identically  distributed  with 

mean  0 and  variance  1 , and  that  N'  is  a stopping  time  tending  to  " as 

Scunpling  costs  tend  to  0 . If  there  exists  n',  nonrandom,  such  that  N'/n'  1 

(in  probability)  then 
1 

^^2  I X^  -►  Z (in  distribution)  , 


^N-  = 


(N'  ) 2 

where  Z is  normal  with  mean  0 and  variance  1 . 


Proof ; The  result  is  well  known  (Renyi,  1957). 


Theorem  4.1.  If 


such  that  c.c  ^ ->  c.  < "■  , then 

A 1 0 


*1/2 

(N  ) ' ->  Z (in  distribution) 

e 

where  Z is  normal  with  mean  0 and  variance  1 
Proof : 


Pg((N  ) 


1/2 


0 


< X ) 


*1/2  * -1  *1/2-1 

^ (aQ-2+N  )0  ^(>:+(N  - Bq) 


where 


and 


Pq^V  1 - )), 


—-^72  (W  * - N*0'^) 
(N*)  ' 


*1/2  *1/2-1  * -1/2 
(N  )'^'^^x(x+(N  ^ +y.~  ) 


* * 1/2  -1 
R(N  ) = (cxq-2)  ( x+  (N  )^^^) 

Fix  ■->0.  Then  > “ >^  + R(K*))  is 

1 Po^V  1 r(,(R(:«I*)<  -O  . 

From  Lenmia  4.1,  Y •*  Z (in  distribution),  and  thus  the  fir.st  term  .above  tends 
N* 

to  1 -C  (-X-  ) - <;■  (x+  ),  where  >;!(•)  is  the  di  str  i liut  ion  function  of  a standard 

, implies 


* * * 

normal  random  variable-.  But  N /n  -*  1 (a.s.  P^ ) and  n ► 

* ^ 

R(N  ) » 0 (a.s.)  and  hence,  Pq(R(N  ) < -()  -»  0 . Noting  that  c is  arbitrary 


completes  the  proof. 


-8- 


Althouloh  Bayesian  methods  are  used  to  derive  the  procedure  (N 
the  procedure  has  desirable  properties  in  the  classical  framework,  and  these 
properties  are  independent  of  the  prior  distribution  in  Section  2.  In  parti- 
cular, n*  is  approximately  the  best  fixed  sample  size  procedure  if  0 is 

it  1c 

known.  Thus,  the  asNTnptotic  equivalence  of  N and  n is  a strong  result. 
Once  this  equivalence  is  proven,  usual  properties  of  fixed  sample  size  esti- 
mators will  hold  for  the  sequential  estimator  of  0 (as  shown  in  Theorem  4.1). 

The  inclusion  of  two  type  of  costs  in  this  problem  is  much  more  realistic 
than  the  simple  cost  per  arrival.  Also,  when  only  cost  per  arrival  is  con- 
sidered, the  best  sequential  procedure  is  a fixed  sample  size  procedure.  How- 
ever, with  the  inclusion  of  time  cost,  the  best  procedure  is  no  longer  a fixed 
sample  size  procedure,  and  Theorem  3.1  shows  how  these  two  costs  are  weighted 
asymptotically  in  determining  the  sample  size. 
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